Method and apparatus for voice interactive messaging

ABSTRACT

An audio indication of a recipient for a message is received, the message to be sent by a user to the recipient. An electronic database is searched for the recipient. When the recipient is found in the electronic database, information is determined from the electronic database concerning the recipient. An audio prompt is formed including at least some of the information concerning the recipient that was obtained from the electronic database.

This application is a continuation of application Ser. No. 10/916,745, filed Aug. 11, 2004, entitled METHOD AND APPARATUS FOR INTERACTIVE VOICE MESSAGING, to Takeda et al., which itself claims the benefit of U.S. Provisional Patent Application No. 60/554,755, filed Mar. 18, 2004, entitled VOICE INTERACTIVE MESSAGING, to Takeda et al., both of which applications are fully incorporated herein by reference in their entireties.

BACKGROUND OF THE INVENTION

The present invention relates to composing messages. More specifically, the present invention relates to interactive voice messaging for composing messages.

In order to send a message via a mobile phone, a user most often will need to read a small screen, type on a generally awkward and small keypad, use a cryptic handwriting language, correct handwriting translation errors, and select one or more hard/soft buttons on the mobile phone. In general the process of sending a message via a mobile phone requires different user interaction techniques to accomplish a single message sending task.

Thus, a system and method for conveniently creating and sending messages is needed.

SUMMARY OF THE INVENTION

The different embodiments described herein address the above mentioned needs as well as other needs by providing a method and apparatus for voice interactive messaging.

One embodiment can be characterized as a method comprising receiving a request to compose a message; outputting a first prompt requesting a recipient for the message; receiving a first audible response to the first prompt, the first audible response including a recipient of the message; outputting a second prompt requesting a body for the message; receiving a second audible response to the second prompt, the second audible response including a body of the message; and composing the message.

Another embodiment is characterized as an electronic device for voice messaging comprising an audible guide module for audibly prompting a user for recipient information and for audibly prompting a user for message information; a voice recognition module for interpreting received responses to the audible prompt for the recipient information and the audible prompt for the message information; and a message compiler module for creating a message from the responses to the audible prompt for the recipient information and the audible prompt for the message information.

A subsequent embodiment includes an electronic device comprising means for receiving a request to compose a message; means for outputting a first prompt requesting a recipient for the message; means for receiving a first audible response to the first prompt, the first audible response including a recipient of the message; means for outputting a second prompt requesting a body for the message; means for receiving a second audible response to the second prompt, the second audible response including a body of the message; and means for composing the message.

Still another embodiment is characterized as an electronic device for voice messaging comprising an audible guide module for audibly prompting a user for message information; a voice recognition module for interpreting received responses to the audible prompt for the message information; and a message compiler module for creating a message from the response the audible prompt for the message information.

BRIEF DESCRIPTION OF THE DRAWINGS

The above and other aspects, features and advantages of the present invention will be more apparent from the following more particular description thereof, presented in conjunction with the following drawings, wherein:

FIG. 1 is a block diagram illustrating a voice messaging system in accordance with one embodiment;

FIG. 2 is a block diagram illustrating a voice messaging system in accordance with an alternative embodiment;

FIGS. 3A-3C are a flow diagram illustrating a method of voice messaging in accordance with one embodiment; and

FIG. 4 is a flow diagram illustrating a method of voice messaging in accordance with another embodiment.

Corresponding reference characters indicate corresponding components throughout the several views of the drawings. Skilled artisans will appreciate that elements in the figures are illustrated for simplicity and clarity and have not necessarily been drawn to scale. For example, the dimensions, sizing, and/or relative placement of some of the elements in the figures may be exaggerated relative to other elements to help to improve understanding of various embodiments of the present invention. Also, common but well-understood elements that are useful or necessary in a commercially feasible embodiment are often not depicted in order to facilitate a less obstructed view of these various embodiments of the present invention. It will also be understood that the terms and expressions used herein have the ordinary meaning as is usually accorded to such terms and expressions by those skilled in the corresponding respective areas of inquiry and study except where other specific meanings have otherwise been set forth herein.

DETAILED DESCRIPTION

The following description is not to be taken in the limiting sense, but is made merely for the purpose of describing the general principles of the invention. The scope of the invention should be determined with reference to the claims.

Referring to FIG. 1 a block diagram is shown illustrating a voice messaging system in accordance with one embodiment. Shown is an electronic device 100, a sound recorder 102, a sound database 104, a comparing module 106, an audible guide module 108, a recorded guide module 110, a speaker 112, a message composer 114, a contact database 116, a multimedia database 118, and a message 120.

The electronic device 100 includes the sound recorder 102, the sound database 104, the comparing module 106, the audible guide module 108, the recorded guide database 110, the speaker 112, the message composer 114, the contact database 116, and the multimedia database 118. The sound recorder 102, the sound database 104, the comparing module 106, the audible guide module 108, the recorded guide database 110, the message composer 114, the contact database 116, and the multimedia database 118 are implemented, in one embodiment, using a processor and one or more memory devices. The processor controls the operation of the electronic device and controls the access and the operation of the one or more memory devices.

The electronic device 100 is configured to compose and send messages of different types (e.g., voicemail, email and instant messaging) in response to a user audibly interacting with the electronic device 100. Preferably, the electronic device 100 audibly prompts the user for components or pieces of the message in order to compose the message.

In operation, the sound recorder 102 receives a voice input and converts the voice input into sound data. The comparing module 106 compares the sound data to data stored in the sound database 104. The sound database 104 stores previously recorded words and phrases for a user. In this manner, the electronic device 100 is trained by the user to recognize the user's voice patterns. This is known as speaker dependent voice recognition. In order to populate the sound database 104, the user optionally will go through a training process where commands are spoken by the user and recorded into the sound database 104.

Upon finding a match between the sound data and data stored in the sound database 104, information indicating a match is sent to the audible guide module 108. The audible guide module 108 analyzes the information sent from the comparing module 106 and determines the next piece of information needed from the user to compose a message. The audible guide module 108 then sends a signal to the recorded guide module 110. The recorded guide module 110 outputs a request for information through the speaker 112. The output, in one embodiment, is a request for a recipient of a message. The electronic device 100 then waits for a response from the user.

The sound recorder 102 next records the response from the user as sound data. The comparing module 106 compares the recorded sound data to information in the sound database 104. The recorded sound data is information that will be used to compose a message, for example, a recipient, a subject, a body, a message type, or an attachment. The electronic device treats the response as corresponding to the information requested in the previous prompt for information. For example, if the previous request was “Who would you like to send a message to?” then the recorded sound data is treated as recipient information and used to gather information from the contact database 116. Similarly, if the previous request was “What file would you like to attach to the message?” then the recorded sound data is used to attach a file from the multimedia database 118. In another example, if the previous request was “Please state your message?” the recorded sound data is used to compose the body of the message 120. This will be further described and illustrated herein with reference to FIG. 3.

The body of the message can be delivered to the recipient in a number of different ways. For example, in one embodiment, the body of the message is stored as a sound file and sent as an attachment in an email. Alternatively, the body of the message is converted into text and sent to the recipient, for example, as an instant message or an email. The message composer 114 utilizes all of the responses to the requests for information in order to assemble the final message. The message is then sent to the user utilizing, for example, a cellular telephone system and a cellular network. In a preferred embodiment the electronic device is a cellular telephone, however, the electronic device is for example, a desktop computer, a laptop computer, a PDA, and a tablet PC in alternative embodiments.

Referring to FIG. 2 is a block diagram illustrating a voice messaging system in accordance with an alternative embodiment. Shown is an electronic device 200, a sound recorder 202, a sound database 204, a messaging engine and voice interface module 206, an audible guide module 208, a recorded guide module 210, a speaker 212, a message composer 214, a contact database 216, a multimedia database 218, and a message 220.

The electronic device 200 is configured to compose and send messages of different types (e.g., voicemail, email and instant messaging) in response to a user audibly interacting with the electronic device 200. Preferably, the electronic device 200 audibly prompts the user for components or pieces of the message in order to compose the message.

In operation, the sound recorder 202 receives a voice input and converts the voice input into sound data. The messaging engine and voice interface module 206 interprets the sound data and determines what action to take based upon the analysis of the sound data. The sound database 204 stores information needed for a speaker independent voice recognition system. In this manner, the device does not need to be trained by the user to recognize the user's voice patterns as the device has the capability to recognize any voice input through analysis of the sound data. This is known as speaker independent voice recognition.

After analyzing the sound data and data stored in the sound database 204, the messaging engine and voice interface module 206 determines how the electronic device 200 should respond to the user input. For example, if the user would like to send a message, the messaging engine and voice interface module 206 directs the audible guide module 208 to request additional information from the user. The audible guide module 208 then retrieves a request from the recorded guide database 210 and the request is then output through the speaker 212. The output, in one embodiment is a request for a recipient of a message. The electronic device 200 then waits for a response from the user. The sound recorder 202 next records the response from the user as sound data. The messaging engine and voice interface module 206 use the recorded sound data and determine a recipient for the message. The recipient for the message will be used by the message composer 214 to create a message.

The electronic device 200 continues to output audible requests for all of the data needed to compose a message and records the replies from a user as sound data. The recorded sound data is information that will be used to compose a message, for example, a recipient, a subject, a body, a message type, or a filename of an attachment. The electronic device treats the response as corresponding to the information requested in the previous prompt for information. For example, if the previous request was “Who would you like to send a message to?” then the recorded sound data is treated as recipient information. Similarly, if the previous request was “What file would you like to attach to the message?” then the recorded sound data used to determine an appropriate file to attach. In another example, if the previous request was “Please state the body of your message?” the recorded sound data is treated as the body of the message.

Referring next to FIGS. 3A-C a flow diagram is shown illustrating a method of voice messaging. In the illustrative embodiment, the method is performed through interaction with an electronic device. For example, the electronic device is in alternative embodiments, a cellular telephone, a personal digital assistant, a desktop computer, a laptop computer, or a tablet personal computer (PC).

In step 300, a user interacts with an electronic device in order to invoke a messaging application. For example, the user issues a voice command or presses a button (soft or hard key) of the electronic device. In step 302, the electronic device recognizes the user's request and invokes a messaging application. The electronic device next proceeds to audibly guide the user through a message creation session in step 304.

In step 306, the electronic device requests a recipient of the message. For example, the device will audibly output “To who would you like to send a message?” through, for example, a speaker on the electronic device. The electronic device then awaits a response (e.g., John Smith) from the user. In step 308, the user speaks the recipients name (e.g. John Smith) which the electronic device receives and records. In step 310, the electronic device searches an address book for the recorded response. For example, the electronic device searches for “John Smith” in the address book. Continuing the example of “John Smith” being the recipient, in step 312, the electronic device gathers information for John Smith's account and determines the next type of information needed to compose a message. For example, the electronic device determines the types of messages that John Smith can receive and also optionally determines which devices John Smith currently has access to or prefers to be contacted at.

The electronic device then, in step 314, collects voice data from the sound database and composes an audible prompt. For example, “John Smith is now available at mobile phone, office email and instant messenger (IM). The default message is IM. Is this okay?” The electronic device then awaits a response from the user. In an alternative embodiment, the electronic device simply chooses a type of message to be sent based upon, for example, the preferred message type of the recipient. This feature will be described herein below in greater detail with reference to FIG. 4. In step 320, the user responds, for example, “Yes” or “No” and the electronic device records and processes the response. In the example shown the user responds by saying “No.” Next, in step 322, the electronic device audibly guides the user for a desired message type. In step 324, the electronic device directs the user interaction by outputting the audible prompt “choose the addressee for your message. Say Yes for your choice.” In step 326, the electronic device identifies a first messaging method for the recipient of the message and in step 328 outputs the audible prompt “Mobile Phone.” The electronic device, in step 330, determines that “yes” was not spoken by the user and proceeds to identify a second messaging method for the recipient. Following, in step 332, outputs audible prompt “Office Email” and waits for a response. In step 334, the user speaks “Yes” and the electronic device, in step 336, records the user response, stores the response as sound data, and composes an audible confirmation for the user. In step 338 the electronic device audibly outputs “message will be sent to John Smith's office email.”

In step 340, the electronic device audibly guides the user by prompting the user for a body of the message and in step 342 outputs an audible prompt requesting a body of the message that states “please speak your message now.”

In step 344, the user speaks the body of the message. The electronic device then awaits a response from the user and records the response. In step 346, the device records the message and waits for an indication that the body of the message is complete (e.g., two seconds of silence indicates the end of the message in step 350). The electronic device, in steps 348 and 350, then audibly confirms that the message is complete by outputting “Is your message completed?” from the speaker. In step 356, upon receiving a user response of “Yes” the electronic device outputs an audible confirmation that the message is being sent in step 358. For example, in step 360, the electronic device outputs “Your message is being sent to John Smith's office e-mail” to confirm that all of the information needed to send the message has been received.

In step 354, if the message was not completed, the electronic device will resume recording the message. Next in Step 362, the electronic device composes the message using the recipient, type of message and body of the message gathered from the user. In one embodiment, the body of the message (i.e., the recorded sound data) is converted into text, while in another embodiment, the response is sent to the recipient as an audio file or audible message. For example, an email message can include the body with a text message or the email could include an audio file as an attachment. Lastly, in step 364, the electronic device audibly output “Your message has been sent to John Smith” to confirm the message was sent to the recipient. Each time the user speaks a response the electronic device processes the response according to whether the electronic device is a speaker independent or speaker dependent voice recognition system. For example, in a speaker dependent system, the sound data from the user's response is compared to a database containing pre-recorded words and phrases spoken by the user.

Referring to FIG. 4, a flow diagram is shown illustrating a method of voice messaging in accordance with one embodiment.

First in step 400, an electronic device receives a request to compose a message. For example, a user selects a messaging function on the electronic device through a menu option or a messaging button. Alternatively, a user can audibly request to send a message. The electronic device employs either a speaker dependent or speaker independent voice recognition system. For a speaker dependent system, the electronic device will compare the audible request to send a message (converted to sound data) to a database of prerecorded words or phrases in order to recognize the request. For a speaker independent voice recognition system, the system will recognize what the user says and proceed without the need to store pre-record user-specific phrases into the database.

Next in step 402, the electronic device outputs a first prompt requesting a recipient for the message. In a preferred embodiment, the electronic device outputs an audible prompt that requests the recipient of the message. Alternatively, the device can output the request for the recipient by placing the prompt on a display screen of the electronic device.

Next in step 404, the electronic device receives a first audible response to the first prompt, the first audible response including a recipient of the message. For example, the electronic device receives the name “John Smith” that was spoken by the user at a microphone and records the response as sound data.

Next in step 406, the electronic device outputs a second prompt requesting a body for the message. In a preferred embodiment, the electronic device outputs an audible prompt that requests the body of the message. Alternatively, the device can output the request for the body of the message by placing the prompt on a screen of the electronic device.

In step 408, the electronic device receives a second audible response to the second prompt, the second audible response including a body of the message.

Following, in step 410, the electronic device composes the message. For example, the message is composed for the intended recipient including the body of the message. In one example, an email is created and sent to the recipients email address. The email can contain an attachment that includes sound data of the message body or the email can include a text portion that has been created by the electronic device by converting the sound data into text. Alternatively, the electronic device can leave a voice mail for the recipient or send the recipient an instant message.

In an optional step, the electronic device outputs a prompt requesting a message type for the message. For example, the device can prompt the user as to whether to send a voice message or and email. The electronic device then receives an audible response including a type of message to be sent. In an alternative embodiment, the electronic device determines what type of message to send based upon a current status of the recipient or a preferred contact mode for the recipient. One method and system for identifying a current status of a recipient, or what type of message they would currently prefer to receive is described in U.S. patent application Ser. No. 10/872,993, filed Feb. 2, 2004 entitled “AUTOMATIC USER DEVICE PRESENCE REGISTRATION SYSTEM,” which is fully incorporated herein by reference. Therefore, in one embodiment, the electronic device automatically determines what type of message will be sent to the recipient and does not need to prompt the user for the type of message to send.

When the electronic device automatically determines what type of message to send to the recipient, the electronic device will also automatically determine the best form for the body of the message. For example, in an email message, the body could be converted to text if the recipient is at a work computer. However, if the recipient is only available on their PDA, the body could be an audio file attachment the user can listen to, as reading the text may be more difficult on the small screen of a PDA.

In another aspect of the present invention, the electronic device can be used to update the user's availability and preferred contact type. Preferably, the user can audibly interact with the electronic device in order to update their availability and preferred contact type. As described before, a system for AUTOMATIC USER DEVICE PRESENCE REGISTRATION is described in U.S. application Ser. No. 10/872,993.

While the invention herein disclosed has been described by means of specific embodiments and applications thereof, other modifications, variations, and arrangements of the present invention may be made in accordance with the above teachings other than as specifically described to practice the invention within the spirit and scope defined by the following claims. 

1. A method comprising: receiving an audio indication of a recipient for a message, the message to be sent by a user to the recipient; searching an electronic database for an existing electronic account of the recipient, the account populated with information, the information not being created or interactively provided by the user sending the audio indication, the information including an existing first informational component and an existing second informational component; subsequently, when the account of the recipient is found in the electronic database, retrieving the first informational component and the second informational component from the account; subsequently automatically analyzing the first informational component and the second informational component from the account of the recipient and, based upon the analyzing, automatically determining a variable and non-fixed, next type of information that is needed to compose the message, the next type of information not being interactively provided by the user and being different from the first informational component and the second informational component; subsequently forming an audio prompt including content that relates to the variable, next type of information that is needed to compose the message, the entire format of the entire prompt being customizable and changeable based upon the determined variable, next type of information and the identity of the recipient, the prompt not being part of a predetermined script or series of prompts.
 2. The method of claim 1 further comprising presenting the audio prompt to the user.
 3. The method of claim 2 further comprising receiving an audio response to the prompt.
 4. The method of claim 3 further comprising guiding the user through a plurality of inquiries and receiving information from the user based upon the guiding.
 5. The method of claim 1 wherein the electronic database comprises an electronic address book.
 6. The method of claim 1 wherein the information comprises a message type.
 7. The method of claim 1 further comprising receiving a request to compose a message.
 8. The method of claim 7 further comprising asking the user for the recipient of the message.
 9. An apparatus comprising: an interface having an input and an output, the interface configured to receive an audio indication of a recipient for a message at the input, the message to be sent by a user to the recipient; a controller coupled to the interface, the controller configured to search an electronic database for an existing account of the recipient identified in the audio indication, the account populated with information, the information not being created or interactively provided by the user sending the audio indication, and when the account of the recipient is found in the electronic database, determine selected information from the account concerning the recipient, the information including an existing first informational component and an existing second informational component, the controller configured to automatically analyze the first informational component and the second informational component information from the account and, based upon analysis, automatically determine a variable and non-fixed, next type of information needed to compose the message, the controller further configured to form an audio prompt at the output of the interface, the prompt including content that relates to the variable, next type of information needed to compose the message, the next type of information not being interactively provided by the user and being different from the first informational component and the second informational component, the entire format of the entire prompt being customizable and changeable based upon the determined variable, next type of information and the identity of the recipient, the prompt not being part of a predetermined script or series of prompts.
 10. The apparatus of claim 9 wherein the audio prompt is presented to the user at the output of the interface.
 11. The apparatus of claim 10 wherein an audio response to the prompt is received at the input of the interface.
 12. The apparatus of claim 11 wherein the controller is further configured to guide the user through a plurality of inquiries and information from the user is received at the input of the interface as a result of the inquiries.
 13. The apparatus of claim 9 wherein the electronic database comprises an electronic address book.
 14. The apparatus of claim 9 wherein the information comprises a message type.
 15. The apparatus of claim 9 wherein a request to compose a message is received at the input of the interface.
 16. The apparatus of claim 15 wherein the controller is further configured to ask the user for the recipient of the message. 